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Abstract. The Wide Field X-ray Telescope (WFXT) will provide tens of millions of AGN, 
with more than 4x10 s expected at z> 3. Here we review the issues present in the identifica- 
tion of (large) samples of faint and high-redshift X-ray sources, and describe a statistical, 
powerful tool that can be applied to WFXT catalogs. The depth of associated optical and 
near infrared catalogs, needed for a reliable and as much complete as possible identification, 
are also discussed, along with the combined synergies with existing or planned facilities. 
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1. Scientific drivers 

One of the main aims in extragalactic astron- 
omy for the next decade is the study of the 
co-evolution of galaxies and the Super Massive 
Black Holes (SMBH) residing in their centre, 
out to the very first epochs of galaxy forma- 
tion. In this respect, deep and sensitive X-ray 
observations will be the unique instrument to 
reveal the accretion light from SMBH in galac- 
tic nuclei at high-z, which are often invisible 
at longer wavelengths because of intergalactic 
absorption and dilution by the host galaxy. 

The study of Active Galactic Nuclei (AGN) 
demography at z> 3 is one of the key science 
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drivers for t he Wide Field X-ra y Telescope 
(WFXT, e.g. lForman et al.ll2010h . In the past 
decade, the characterization of the early phase 
of SMBH growth has been limited to the study 
of optically selected QSOs detected mostly in 
the SDSS survey, i.e. sampling only the un- 
obscured and most luminous tail of the AGN 
population. Deep and medium deep Chandra 
and XMM-Newton surveys have allowed the 
study of X-ray selected QSOs up to relatively 
high redshifts, z~ 3-4. At higher redshifts, 
present X-ray surveys are highly incomplete 
and strongly limited by the small area sam- 
pled. As an example, there are only a few 
X-ray selected QSO with confirmed spectro- 
scopic redshifts at z> 5 (see Barger et al.l2005h 
Moreover, the extrapolations of the X-ray lu- 
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minosity function (LF) as obtained combining 
various XMM and Chandra surveys differ by 
up an order of magnitude (see Fi gure 1, and 
refere nce therein, adapted from Bru sa et ail 
l2010h . As a comparison, the number of op- 
tically selected QSOs revealed up to z~ 6 
is approaching 50, i.e. large enough to de- 
termine their LF which encodes the infor- 
mation about the history of SMBH build up 
and the int egrated flux of UV ionizing radi- 
ation (e.g. iFan et al.l l2006t Ijiang et al.ll2009t 
IWillott et alJl2010l and reference therein). An 
unbiased search of X-ray selected z~ 5 - 6 
QSOs would require to survey several hun- 
dreds of square degrees to a depth of the or- 
der of ~ 10~ 15 erg cirT 2 s _1 and thus be- 
yond the capabilities of current X-ray tele- 
scopes. WFXT will offer the unique opportu- 
nity to explore the high-redshift universe, pro- 
viding about two order of magnitudes larger 
samples with respect to the current SDSS sam- 
ples (~ 2000 z> 6 AGN vs. ~ 50), opening 
a completely new, unexplored window for LF 
analysis, (see Gilli et al. 2010, this volume, for 
a full description of the high-redshift AGN de- 
mography with WFXT). 

2. Identification issues 

The identification of the correct counterparts 
of both obscured and unobscured AGN is the 
first, crucial step for a full characterization of 
the physical and evolutionary properties of the 
entire population. At high redshifts, this pro- 
cess is further complicated by the fact that 1) 
z> 3 sources constitute only a tiny fraction 
(~ 1%) of the entire X-ray population (< 0.1% 
for z> 6 sources) and 2) these objects are usu- 
ally faint in the optical band, because the emis- 
sion would be strongly reduced by cosmologi- 
cal dimming, and/or, for obscured sources, the 
intrinsic AGN emission is absorbed by the sur- 
rounding material. As a result, the probabil- 
ity of finding by chance a galaxy of R> 24 
in the X-ray error box is not negligible even 
with Chandra given the high surface density 
of backgroun d galaxies (see e xtensive discus- 
sions in, e.g., iLuo et alfc olO). The identifica- 
tion process is made easier by using deep near 
infrared images given that AGN are strong IR 
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Fig. 1. The number density of logL2-ioi f v > 
44.5 erg s _1 X-ray selected AGN vs. redshift 
as obtained from published LF, as labeled 
dGilli. Comastri & HasingeJ 120071; ISilverman et alj 



20081: lEbrero et alJ 120091 lYencho et al.l l2009r . 
Airdetal.1 l2010h along with datapoints from 
the XMM-COSM OS surveys (green c ircles , 
iBrusa et all l2009d) and from lAird et alj d2010l . 
black squares). The red star at z = 6 represents 
a conservative estimate of the z~ 6 QSO space 
density computed from the optical one assuming 
no evolution of the a ox . For comparison (black 
shaded area) we plo t the results fo r very bright 
QSOs (M 145 o < -27. lFanetaHl2006l, . rescaled by 
an arbitrary factor. 



emitters and the K-band flux is more tightly 
correlated with the X-ray flux than the optical 
(obscured) one (see lBrusa et al.li2009bl) . 

2.1. The likelihood ratio technique 

A statistical, powerful method extensively 
exploited in deep XMM-Newton and Chandra 
surveys in the past years to look for the 
correct counterparts of X-ray sources 
is the "likelihood ra ti o" (L R) technique 
(Suth erland & Saunders! Il992t IBrusa et all 
I2005I) . The method calculates the probability 
that a source is the correct association by 
weighting the information on the X-ray to op- 
tical distance, the surface density of (possible) 
false coincidence background objects and the 
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brightness of the chosen counterpart: 



The object with the hi ghest LR valu Jj] (above 
a cert ain threshold; see lSutherland & Saundersl 
1 19921 for details) is the most likely counterpart; 
when two or more sources have comparable 
LR values, a unique identification is not possi- 
ble and both sources have a similar probability 
of being the correct identification ("ambigu- 
ous" sources). Using catalogs extracted from 
different bands (e.g., optical and infrared) 
may lead to different choices of the correct 
counterparts, and this information should be 
taken into account, too. In the following we 
will show the potentiality (and the challenges) 
on the use of the LR technique applied to 
WFXT data and the multiwavelength datasets 
available. We will make the case separately 
for the Wide (Fo.5-2^3 x 10~ 15 erg crrT 2 
s" 1 ), Medium (F , 5 -2>5 x 10" 16 erg crrT 2 
s -1 ), and Deep (F .5-2c3 x 10" 17 erg crrT 2 
s _1 ) parts of the WFXT survey (Rosati et al. 
2010, this volume), based on the experience 
developed in the framework o f the XMM- 
COSMOS (iHasineer et alJ2007h. C-COS MOS 
dElvis et al Il2009l) and CDFS dLuo et al.ll2008l) 
surveys, where multiwavelength catalogs 
(e.g. optical to mid-infrared) resulted crucial 
to keep the fraction of ambiguous or false 
identification at minimim. 

2.2. Wide and Medium survey: 
COSMOS lessons 

To quantify the expected efficiency of the LR 
technique on the sources detected in the WFXT 
Wide survey, we first limited th e XM M- 
COSMOS sample dCappelluti et all 120091) at 
fluxes larger than the expected limiting flux 
of the WFXT Wide survey (Fo.5-2keV > 
3 x 10~ 15 erg cirT 2 s ) and comparable to 
those expected for t he eROSITA deep survey 
dPredehl et alJl20ld see also Cappelluti et al. 



1 q(m) is the expected probability distribution, as 
a function of magnitude, of the true counterparts, 
/( r) is the probability distribution function of the po- 
sitional errors of the X-ray sources assumed to be a 
two-dimensional Gaussian, and n(m) is the surface 
density of background objects with magnitude in. 



2010, this volume). Then we looked at the 
breakdown of the combined optical and IR 
identifications from the LR technique: 95% of 
the sources have been provided "secure" as- 
sociations, while the remai ning 5% show am- 
biguous counterparts in the lBrusa et alj d2010l) 
catalog. The reliability of the method has 
been tested a posteriori using Chandra, and 
turned out to be 99.6%: only one of the 
245 unique/reliable XMM-COSMOS sources 
at fluxes larger than the WFXT Wide survey re- 
sulted associated to the wrong optical counter- 
part. Moreover, the statistical properties (such 
as redshifts, magnitudes, colors etc.) of the 
primary and secondary counterpart within the 
ambiguous sources are almost indistinguish- 
able, and therefore the choice of the counter- 
part among the two does not in principle affect 
the characterization of the full X-ray popula- 
tion. 

The WFXT Medium survey has been des- 
gigned to cover ~ 3000 deg 2 at fluxes of the or- 
der of ~ 5x 10~ 16 erg cirT 2 s -1 , i.e. comparable 
to th e depth reached by the C-COSMOS sur- 
vey dElvis et alJl2009h .' Following a procedure 
similar to that applied to XMM-COSMOS data 
(see above), and thanks to the smaller (< 1") 
angular resoluti on of Chandra with respect to 
XMM-jVewfow. lCivano et alJ d2010h were able 
to provide secure associations for more than 
95% of the sources detected above the ex- 
pected WFXT Medium survey limit. The frac- 
tion of ambiguous sources in this sample is re- 
duced to ~ 2%, only 1 .4% of the X-ray sources 
are not identified. 

Taking into account that the WFXT posi- 
tional accuracy is expected to be better than 
that of XMM-Newton (HEW=5" - 10" for 
WFXT vs. HEW ~ 15" for XMM-Newton), 
and only slightly worse than the Chandra one 
(HEW ~ 2" when averaged across the FOV), 
we can safely conclude that counterpart iden- 
tification would not be an issue for the WFXT 
Wide and Medium surveys, provided that the 
depth of the optical and IR ancillary data is 
enough to match the X-ray fluxes (see Section 
2.4). 
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Table 1. Optical and IR ideal coverage depth 
for WFXT AGN surveys 



Survey 


plim 
1 0.5-2 
CgS 


I 


K 


Wide 


4X1(T 15 


23.0 


21.5 


Medium 


5xKr 16 


25.0 


23.0 


Deep 


3xl(T 17 


25.5 


23.5 



2.3. Deep survey: CDFS lessons 

The WFXT deep survey (F .5- 2 fev > 3 x 1(T 17 
erg crrT 2 s _1 in the soft X-ray band) would 
be almost as de ep as the CDFS 2Ms survey 
dLuo et al.ll2.008h . over an area that is a factor 
of ~ 1000 larger. The detailed ide ntification 
analy ses for the 2Ms CDFS sources dLuo et al.l 
2010), implementing likelihood ratio match- 
ing accross five bands (R, z, K, 3.6^m and 
1.4 GHz) have shown the power of this ap- 
proach while also quantifying the significant 
challenges in source identification at faint mag- 
nitudes (R> 25). Indeed, it was possible to 
provide identifications for 96% of the X-ray 
sources; among them, 90% have been classi- 
fied as unique/secure, and 10% as ambiguous. 
At these deep X-ray fluxes, the statistical prop- 
erties of primary and secondary counterparts 
between the ambiguous sources are often dif- 
ferent. Moreover, 4% of the sources, despite 
the excellent quality and depth of the mul- 
tiwavelength information available, were not 
associated to any counterpart, i.e. the correct 
counterpart is most likely fainter than the im- 
age depth. This exercise shows that the identi- 
fication of the faintest among the WFXT coun- 
terparts in the deep survey may be challeng- 
ing; In this respect, the HEW goal of 5" is a 
crucial requirement to keep at acceptable val- 
ues the (already not negligible) identifications 
issues and to fully characterize the multiwave- 
length properties of the X-ray sources at the 
highest redshifts. 



2.4. Depth of optical infrared images 

The power of the LR technique described in 
the previous subsections is strongly related to 
the depth of the optical and infrared images 
and catalogs that will be used to identify the 
X-ray sources. The challenge will be to pro- 
vide a homogeneous and (enough) deep cov- 
erage for the different WFXT surveys. At the 
limiting flux of the WFXT wide survey an op- 
tical coverage to I~ 23 and K~ 21.5 would 
be enough to identify ~ 90% of the X-ray 
sources (see Figure 2, upper panels, and Table 
1), but this should be on the entire surveyed 
area. At the time WFXT will be launched, 
PanSTARRS0 will have surveyed ~ 30.000 
deg 2 to I~ 24.2, and will provide imaging in 
at least 5 bands, needed to characterize the 
SED of the X-ray sources and isolate high- 
z candidates (see next Section). On a longer 
timescale, Euclic0 will cover the entire extra- 
galactic sky in the IR down to H~ 24 (roughly 
corresponding to K~ 23), and will provide also 
spectra. The LOw Fr equency ARray (LOFAR, 
iMorpanti et alj 1201 Oh . that will survey the 
northern sky down t o a flux of 0.8 mJy at 120 
MHz (see Fig. 2 in lMorganti et al.l l2010). may 
be crucial to correctly identify radio emitters 
X-ray sources (radio AGN and starbursts). 

PanSTARRS will also provide identifica- 
tion for a substantial fraction (> 50%) of the 
sources detected in the WFXT Medium and 
Deep surveys. In order to identify a fraction as 
large as 90% of the sources in these surveys, a 
coverage in the optical and near-infrared down 
to I~ 25.5 and K~ 23.5 is needed (see Figure 2, 
middle and lower panels, and Table 1). LSST 
is a proposed facility exp ected to coyer th e 
southern sky down to I~ 27 dAbell et alj2010h ; 
similarly to PanSTARRS, LSST will also pro- 
vide multiband photometry at a depth compa- 
rable to the I-band limit. The coordination with 
present and next generation facilities is manda- 
tory, in order to choose the areas for the deep 
surveys which maximize the availability of the 
deepest multiwavelength coverage, in particu- 
lar: JWST0, the PanSTARRS deep survey (I~ 

2 http://pan-starrs.ifa.hawaii.edu 

3 http://sci.esa.int/euclid 

4 http://www.jwst.nasa.gov 
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Fig. 2. I-band (left panels) and K-band (right panel) magnitude distributions expected in the three different 
WFXT surveys (Wide, Medium and Deep, from top to botto m). The expect ed magnitude distrib utions have 
been extracted fro m the XMM-COSMOS jBrusa et aljioloh . C-COSMOS dCivano et al.ll2010h . and CDFS 
dLuo et alj|2010h samples limited to fluxes F 05 _2 > 3 x 10~ 15 erg cirT 2 s -1 , Fo.5-2 > 5 x 10~ 16 erg cnT 2 
s"" 1 and Fo.5_2 > 3 x 10~ 17 erg cm -2 s -1 in order to match the Wide, Medium and Deep limiting fluxes, 
respectively. The dashed lines mark the magnitudes at which most of the sources (90%) are identified. 



28 over 28 deg 2 ), the LSST deep survey (I~ 28 
on a few hundreds deg 2 ), Euclid (K~ 25.5 on 
40 deg 2 ), the VISTA VIDEO survey (down to 
K=23.5 over 15 deg 2 ). 

3. Selecting z> 3 (or z> 6) AGN 

Photometric redshifts of X-ray selected faint 
sources (R= 24 - 27) are essential for en- 
abling science analyses and planning deep 
spectroscopy, and resulted crucial in isolat- 
ing the high-z population in, e.g., XMM- 
COSMOS and CDFS. A similar, detailed 



source characterization requiring multiband 
imaging may be feasible only for small sam- 
ples of WFXT sources. In this context, key 
resources will be again the upcoming LSST, 
PanSTARRS, Euclid, and (as far as spec- 
troscopy for the Wide survey is concerned) the 
SDSSIII-BOSSEI project. However, the high- 
z population shows on average fainter mag- 
nitudes than the overall X-ray source pop - 
ulation (see Figure 3 in Bru sa et al . 2009a), 
and therefore may remain among the uniden- 
tified population, if deep enough optical and 

5 http://www.sdss3.org/boss.php 



Brusa: Identifications in WFXT 



287 



near infrared coverage is not provided over 
the full area Another possibility is to search 
for X-ray counterparts on preselected high-z 
QSO on the basis of opti cal colours and/or 
dropouts technique s (e.g. ICasev et all 120081 : 
Stei del et all [2003), extended including the 
near-infrared ba nds in order to samp le the z> 6 
population re.g. lWillott et all feolO). In this re- 
spect, the unprecedent combination of depth 
and area of WFXT will result in a much bet- 
ter characterization of the physical properties 
(such as bolometric luminosity and accretion 
rate) of the first accreting supermassive black 
holes. Moreover, z> 6 color selections suf- 
fer from significant contamination stellar ob- 
jects (brown dwarfs are overwhelmingly more 
abundant and the spectroscopy success rate 
for z> 6 QSOs is only ~ 20%). The com- 
plete SED characterization from NIR to X- 
ray will be able to resolve issues on contam- 
ination and completeness. For a non negligi- 
ble fraction of the high-z candidates (a few out 
of a few hundreds, see also Matt et al. 2010, 
this volume), redshifts may be directly mea- 
sured from the FeKo; line (see examples in 
ICivano. Comastri"& Brusa 2005). 

4. Conclusions 

• WFXT will provide orders of magnitudes 
larger samples of high-redshift (z> 6) 
AGN compared to current (e.g. SDSS) op- 
tical surveys; 

• the counterpart identification for WFXT 
sources selected in the Wide survey will 
be relatively easy, synergies with present 
and future large area / all sky facilities (e.g. 
PanSTARRs, LSST, Euclid) are pursued; 

• the secure identification of the counter- 
parts detected in the WFXT Medium and 
Deep surveys would greatly benefit of the 
smallest possible angular resolution (the 
5" HEW goal is really auspicable) and 
should heavily rely in the coordination with 
the future optical and NIR deep survey area 
(e.g. LSST, JWST); 

• multiwavelength information is mandatory 
in order to get the redshift and the physical 
properties of the high-z AGN in the WFXT 
surveys. 
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